Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 546651 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 84918 |
| Duplicate rows (%) | 15.5% |
| Total size in memory | 70.9 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 8 |
| Dataset has 84918 (15.5%) duplicate rows | Duplicates |
signup_flow is highly correlated with affiliate_channel and 1 other fields | High correlation |
days_from_account_created_until_first_booking is highly correlated with day_first_booking and 2 other fields | High correlation |
day_first_booking is highly correlated with days_from_account_created_until_first_booking and 2 other fields | High correlation |
day_of_week_first_booking is highly correlated with day_of_week_account_created | High correlation |
year_account_created is highly correlated with days_from_account_created_until_first_booking and 1 other fields | High correlation |
day_account_created is highly correlated with day_first_booking | High correlation |
day_of_week_account_created is highly correlated with day_of_week_first_booking | High correlation |
week_of_year_account_created is highly correlated with year_account_created | High correlation |
affiliate_channel is highly correlated with signup_flow and 2 other fields | High correlation |
first_affiliate_tracked is highly correlated with affiliate_channel | High correlation |
signup_app is highly correlated with signup_flow and 1 other fields | High correlation |
country_destination is highly correlated with days_from_account_created_until_first_booking and 1 other fields | High correlation |
days_from_first_active_until_account_created is highly skewed (γ1 = 67.7710734) | Skewed |
signup_flow has 440570 (80.6%) zeros | Zeros |
days_from_first_active_until_account_created has 545687 (99.8%) zeros | Zeros |
days_from_account_created_until_first_booking has 122231 (22.4%) zeros | Zeros |
day_of_week_first_booking has 127650 (23.4%) zeros | Zeros |
day_of_week_account_created has 89920 (16.4%) zeros | Zeros |
Reproduction
| Analysis started | 2022-09-09 20:13:41.933881 |
|---|---|
| Analysis finished | 2022-09-09 20:14:27.694805 |
| Duration | 45.76 seconds |
| Software version | pandas-profiling v3.2.0 |
| Download configuration | config.json |
| Distinct | 26 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.79915522 |
| Minimum | 0 |
|---|---|
| Maximum | 25 |
| Zeros | 440570 |
| Zeros (%) | 80.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 18 |
| Maximum | 25 |
| Range | 25 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.542365681 |
|---|---|
| Coefficient of variation (CV) | 3.08053781 |
| Kurtosis | 10.61813579 |
| Mean | 1.79915522 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.431729962 |
| Sum | 983510 |
| Variance | 30.71781734 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 440570 | |
| 1 | 23979 | 4.4% |
| 2 | 20907 | 3.8% |
| 25 | 13496 | 2.5% |
| 3 | 11125 | 2.0% |
| 12 | 9423 | 1.7% |
| 24 | 8033 | 1.5% |
| 23 | 3111 | 0.6% |
| 5 | 1616 | 0.3% |
| 4 | 1605 | 0.3% |
| Other values (16) | 12786 | 2.3% |
| Value | Count | Frequency (%) |
| 0 | 440570 | |
| 1 | 23979 | 4.4% |
| 2 | 20907 | 3.8% |
| 3 | 11125 | 2.0% |
| 4 | 1605 | 0.3% |
| 5 | 1616 | 0.3% |
| 6 | 1460 | 0.3% |
| 7 | 1291 | 0.2% |
| 8 | 1395 | 0.3% |
| 9 | 1161 | 0.2% |
| Value | Count | Frequency (%) |
| 25 | 13496 | |
| 24 | 8033 | |
| 23 | 3111 | 0.6% |
| 22 | 628 | 0.1% |
| 21 | 834 | 0.2% |
| 20 | 426 | 0.1% |
| 19 | 432 | 0.1% |
| 18 | 443 | 0.1% |
| 17 | 469 | 0.1% |
| 16 | 429 | 0.1% |
| Distinct | 324 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2278418955 |
| Minimum | 0 |
|---|---|
| Maximum | 1456 |
| Zeros | 545687 |
| Zeros (%) | 99.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 1456 |
| Range | 1456 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 10.34201202 |
|---|---|
| Coefficient of variation (CV) | 45.39117795 |
| Kurtosis | 5689.364061 |
| Mean | 0.2278418955 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 67.7710734 |
| Sum | 124550 |
| Variance | 106.9572126 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 545687 | |
| 1 | 91 | < 0.1% |
| 2 | 76 | < 0.1% |
| 4 | 63 | < 0.1% |
| 3 | 61 | < 0.1% |
| 5 | 16 | < 0.1% |
| 6 | 12 | < 0.1% |
| 13 | 12 | < 0.1% |
| 10 | 9 | < 0.1% |
| 16 | 8 | < 0.1% |
| Other values (314) | 616 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 545687 | |
| 1 | 91 | < 0.1% |
| 2 | 76 | < 0.1% |
| 3 | 61 | < 0.1% |
| 4 | 63 | < 0.1% |
| 5 | 16 | < 0.1% |
| 6 | 12 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 4 | < 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 1456 | 1 | |
| 1369 | 1 | |
| 1361 | 1 | |
| 1148 | 1 | |
| 1036 | 1 | |
| 1011 | 1 | |
| 1006 | 1 | |
| 993 | 1 | |
| 984 | 1 | |
| 962 | 1 |
| Distinct | 1971 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 116.1756386 |
| Minimum | -349 |
|---|---|
| Maximum | 2001 |
| Zeros | 122231 |
| Zeros (%) | 22.4% |
| Negative | 231 |
| Negative (%) | < 0.1% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | -349 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 6 |
| Q3 | 104 |
| 95-th percentile | 673 |
| Maximum | 2001 |
| Range | 2350 |
| Interquartile range (IQR) | 103 |
Descriptive statistics
| Standard deviation | 241.2298612 |
|---|---|
| Coefficient of variation (CV) | 2.076423803 |
| Kurtosis | 9.71243456 |
| Mean | 116.1756386 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.964137586 |
| Sum | 63507529 |
| Variance | 58191.84595 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 122231 | |
| 1 | 67733 | 12.4% |
| 2 | 30398 | 5.6% |
| 3 | 19576 | 3.6% |
| 4 | 14563 | 2.7% |
| 5 | 11686 | 2.1% |
| 6 | 10191 | 1.9% |
| 7 | 9405 | 1.7% |
| 8 | 7926 | 1.4% |
| 9 | 6553 | 1.2% |
| Other values (1961) | 246389 |
| Value | Count | Frequency (%) |
| -349 | 1 | |
| -347 | 1 | |
| -338 | 1 | |
| -308 | 1 | |
| -298 | 1 | |
| -295 | 1 | |
| -288 | 1 | |
| -273 | 1 | |
| -269 | 1 | |
| -261 | 1 |
| Value | Count | Frequency (%) |
| 2001 | 2 | |
| 1999 | 1 | |
| 1995 | 1 | |
| 1992 | 1 | |
| 1991 | 2 | |
| 1990 | 2 | |
| 1980 | 1 | |
| 1979 | 1 | |
| 1977 | 1 | |
| 1976 | 1 |
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.60590212 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 9 |
| median | 17 |
| Q3 | 25 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 9.02490633 |
|---|---|
| Coefficient of variation (CV) | 0.5434758235 |
| Kurtosis | -1.271097189 |
| Mean | 16.60590212 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | -0.09524150734 |
| Sum | 9077633 |
| Variance | 81.44893427 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 29 | 67560 | 12.4% |
| 5 | 17899 | 3.3% |
| 15 | 17753 | 3.2% |
| 16 | 17634 | 3.2% |
| 23 | 17531 | 3.2% |
| 13 | 17506 | 3.2% |
| 22 | 17502 | 3.2% |
| 3 | 17422 | 3.2% |
| 4 | 17410 | 3.2% |
| 25 | 17340 | 3.2% |
| Other values (21) | 321094 |
| Value | Count | Frequency (%) |
| 1 | 14825 | |
| 2 | 16249 | |
| 3 | 17422 | |
| 4 | 17410 | |
| 5 | 17899 | |
| 6 | 16645 | |
| 7 | 15697 | |
| 8 | 15892 | |
| 9 | 17216 | |
| 10 | 16644 |
| Value | Count | Frequency (%) |
| 31 | 2531 | 0.5% |
| 30 | 9169 | 1.7% |
| 29 | 67560 | |
| 28 | 14752 | 2.7% |
| 27 | 15209 | 2.8% |
| 26 | 16328 | 3.0% |
| 25 | 17340 | 3.2% |
| 24 | 16970 | 3.1% |
| 23 | 17531 | 3.2% |
| 22 | 17502 | 3.2% |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.223343596 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 127650 |
| Zeros (%) | 23.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.794642656 |
|---|---|
| Coefficient of variation (CV) | 0.8071818764 |
| Kurtosis | -0.9769581362 |
| Mean | 2.223343596 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.3564866122 |
| Sum | 1215393 |
| Variance | 3.220742261 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 127650 | |
| 2 | 94534 | |
| 1 | 93776 | |
| 3 | 85121 | |
| 4 | 71454 | |
| 5 | 53326 | |
| 6 | 20790 | 3.8% |
| Value | Count | Frequency (%) |
| 0 | 127650 | |
| 1 | 93776 | |
| 2 | 94534 | |
| 3 | 85121 | |
| 4 | 71454 | |
| 5 | 53326 | |
| 6 | 20790 | 3.8% |
| Value | Count | Frequency (%) |
| 6 | 20790 | 3.8% |
| 5 | 53326 | |
| 4 | 71454 | |
| 3 | 85121 | |
| 2 | 94534 | |
| 1 | 93776 | |
| 0 | 127650 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| 2013 | |
|---|---|
| 2012 | |
| 2014 | |
| 2011 | |
| 2010 | 6307 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2186604 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2011 |
|---|---|
| 2nd row | 2010 |
| 3rd row | 2011 |
| 4th row | 2010 |
| 5th row | 2010 |
Common Values
| Value | Count | Frequency (%) |
| 2013 | 222821 | |
| 2012 | 156289 | |
| 2014 | 112780 | |
| 2011 | 48454 | 8.9% |
| 2010 | 6307 | 1.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 2013 | 222821 | |
| 2012 | 156289 | |
| 2014 | 112780 | |
| 2011 | 48454 | 8.9% |
| 2010 | 6307 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 702940 | |
| 1 | 595105 | |
| 0 | 552958 | |
| 3 | 222821 | 10.2% |
| 4 | 112780 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2186604 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 702940 | |
| 1 | 595105 | |
| 0 | 552958 | |
| 3 | 222821 | 10.2% |
| 4 | 112780 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2186604 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 702940 | |
| 1 | 595105 | |
| 0 | 552958 | |
| 3 | 222821 | 10.2% |
| 4 | 112780 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2186604 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 702940 | |
| 1 | 595105 | |
| 0 | 552958 | |
| 3 | 222821 | 10.2% |
| 4 | 112780 | 5.2% |
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.5483334 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.50659347 |
|---|---|
| Coefficient of variation (CV) | 0.5471064488 |
| Kurtosis | -1.186168633 |
| Mean | 15.5483334 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.007569097575 |
| Sum | 8499512 |
| Variance | 72.36213246 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 22 | 19863 | 3.6% |
| 24 | 19734 | 3.6% |
| 3 | 19600 | 3.6% |
| 16 | 19409 | 3.6% |
| 13 | 19394 | 3.5% |
| 23 | 19366 | 3.5% |
| 27 | 19218 | 3.5% |
| 15 | 19010 | 3.5% |
| 21 | 18862 | 3.5% |
| 25 | 18859 | 3.4% |
| Other values (21) | 353336 |
| Value | Count | Frequency (%) |
| 1 | 14078 | |
| 2 | 16999 | |
| 3 | 19600 | |
| 4 | 18343 | |
| 5 | 18510 | |
| 6 | 18057 | |
| 7 | 18149 | |
| 8 | 18014 | |
| 9 | 18745 | |
| 10 | 18327 |
| Value | Count | Frequency (%) |
| 31 | 3100 | 0.6% |
| 30 | 12196 | |
| 29 | 16014 | |
| 28 | 17644 | |
| 27 | 19218 | |
| 26 | 18294 | |
| 25 | 18859 | |
| 24 | 19734 | |
| 23 | 19366 | |
| 22 | 19863 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.457480184 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 89920 |
| Zeros (%) | 16.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.76668792 |
|---|---|
| Coefficient of variation (CV) | 0.718902204 |
| Kurtosis | -0.9641039054 |
| Mean | 2.457480184 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.2654354691 |
| Sum | 1343384 |
| Variance | 3.121186208 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 102184 | |
| 1 | 100371 | |
| 3 | 90626 | |
| 0 | 89920 | |
| 4 | 77429 | |
| 5 | 59675 | |
| 6 | 26446 | 4.8% |
| Value | Count | Frequency (%) |
| 0 | 89920 | |
| 1 | 100371 | |
| 2 | 102184 | |
| 3 | 90626 | |
| 4 | 77429 | |
| 5 | 59675 | |
| 6 | 26446 | 4.8% |
| Value | Count | Frequency (%) |
| 6 | 26446 | 4.8% |
| 5 | 59675 | |
| 4 | 77429 | |
| 3 | 90626 | |
| 2 | 102184 | |
| 1 | 100371 | |
| 0 | 89920 |
| Distinct | 53 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 24.03569188 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 14 |
| median | 23 |
| Q3 | 34 |
| 95-th percentile | 48 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 13.25802809 |
|---|---|
| Coefficient of variation (CV) | 0.5515975226 |
| Kurtosis | -0.8693094212 |
| Mean | 24.03569188 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 0.2749158467 |
| Sum | 13139135 |
| Variance | 175.7753089 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 19 | 17740 | 3.2% |
| 26 | 17626 | 3.2% |
| 23 | 17613 | 3.2% |
| 21 | 17335 | 3.2% |
| 25 | 16979 | 3.1% |
| 20 | 16292 | 3.0% |
| 24 | 16109 | 2.9% |
| 22 | 16053 | 2.9% |
| 18 | 15847 | 2.9% |
| 17 | 15054 | 2.8% |
| Other values (43) | 380003 |
| Value | Count | Frequency (%) |
| 1 | 5566 | |
| 2 | 6247 | |
| 3 | 9887 | |
| 4 | 9732 | |
| 5 | 9360 | |
| 6 | 11291 | |
| 7 | 11144 | |
| 8 | 11486 | |
| 9 | 11834 | |
| 10 | 11559 |
| Value | Count | Frequency (%) |
| 53 | 2 | < 0.1% |
| 52 | 3196 | 0.6% |
| 51 | 4667 | |
| 50 | 6017 | |
| 49 | 7086 | |
| 48 | 6855 | |
| 47 | 7338 | |
| 46 | 8032 | |
| 45 | 8015 | |
| 44 | 6734 |
gender
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| FEMALE | |
|---|---|
| MALE | |
| -unknown- | |
| OTHER | 1682 |
Length
| Max length | 9 |
|---|---|
| Median length | 6 |
| Mean length | 5.678718232 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3104277 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MALE |
|---|---|
| 2nd row | FEMALE |
| 3rd row | FEMALE |
| 4th row | -unknown- |
| 5th row | FEMALE |
Common Values
| Value | Count | Frequency (%) |
| FEMALE | 243863 | |
| MALE | 215453 | |
| -unknown- | 85653 | 15.7% |
| OTHER | 1682 | 0.3% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| female | 243863 | |
| male | 215453 | |
| unknown | 85653 | 15.7% |
| other | 1682 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 704861 | |
| M | 459316 | |
| A | 459316 | |
| L | 459316 | |
| n | 256959 | 8.3% |
| F | 243863 | 7.9% |
| - | 171306 | 5.5% |
| u | 85653 | 2.8% |
| k | 85653 | 2.8% |
| o | 85653 | 2.8% |
| Other values (5) | 92381 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2333400 | |
| Lowercase Letter | 599571 | 19.3% |
| Dash Punctuation | 171306 | 5.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 704861 | |
| M | 459316 | |
| A | 459316 | |
| L | 459316 | |
| F | 243863 | 10.5% |
| O | 1682 | 0.1% |
| T | 1682 | 0.1% |
| H | 1682 | 0.1% |
| R | 1682 | 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 256959 | |
| u | 85653 | 14.3% |
| k | 85653 | 14.3% |
| o | 85653 | 14.3% |
| w | 85653 | 14.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 171306 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2932971 | |
| Common | 171306 | 5.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 704861 | |
| M | 459316 | |
| A | 459316 | |
| L | 459316 | |
| n | 256959 | 8.8% |
| F | 243863 | 8.3% |
| u | 85653 | 2.9% |
| k | 85653 | 2.9% |
| o | 85653 | 2.9% |
| w | 85653 | 2.9% |
| Other values (4) | 6728 | 0.2% |
Common
| Value | Count | Frequency (%) |
| - | 171306 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3104277 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 704861 | |
| M | 459316 | |
| A | 459316 | |
| L | 459316 | |
| n | 256959 | 8.3% |
| F | 243863 | 7.9% |
| - | 171306 | 5.5% |
| u | 85653 | 2.8% |
| k | 85653 | 2.8% |
| o | 85653 | 2.8% |
| Other values (5) | 92381 | 3.0% |
age
Real number (ℝ≥0)
| Distinct | 99 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.41247158 |
| Minimum | 16 |
|---|---|
| Maximum | 115 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 4.2 MiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 23 |
| Q1 | 28 |
| median | 34 |
| Q3 | 43 |
| 95-th percentile | 63 |
| Maximum | 115 |
| Range | 99 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 14.00167861 |
|---|---|
| Coefficient of variation (CV) | 0.3742516338 |
| Kurtosis | 5.989173921 |
| Mean | 37.41247158 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 2.002420326 |
| Sum | 20451565 |
| Variance | 196.047004 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 27868 | 5.1% |
| 31 | 27503 | 5.0% |
| 32 | 26907 | 4.9% |
| 29 | 26399 | 4.8% |
| 28 | 24123 | 4.4% |
| 34 | 23292 | 4.3% |
| 33 | 22965 | 4.2% |
| 27 | 22097 | 4.0% |
| 35 | 20400 | 3.7% |
| 25 | 19094 | 3.5% |
| Other values (89) | 306003 |
| Value | Count | Frequency (%) |
| 16 | 26 | < 0.1% |
| 17 | 81 | < 0.1% |
| 18 | 3435 | 0.6% |
| 19 | 6529 | 1.2% |
| 20 | 1589 | 0.3% |
| 21 | 5727 | 1.0% |
| 22 | 9441 | |
| 23 | 11905 | |
| 24 | 15017 | |
| 25 | 19094 |
| Value | Count | Frequency (%) |
| 115 | 12 | < 0.1% |
| 113 | 4 | < 0.1% |
| 112 | 1 | < 0.1% |
| 111 | 2 | < 0.1% |
| 110 | 400 | 0.1% |
| 109 | 35 | < 0.1% |
| 108 | 15 | < 0.1% |
| 107 | 23 | < 0.1% |
| 106 | 23 | < 0.1% |
| 105 | 6038 |
signup_method
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| basic | |
|---|---|
| 548 |
Length
| Max length | 8 |
|---|---|
| Median length | 5 |
| Mean length | 6.105614002 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3337640 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | |
|---|---|
| 2nd row | basic |
| 3rd row | |
| 4th row | basic |
| 5th row | basic |
Common Values
| Value | Count | Frequency (%) |
| basic | 344824 | |
| 201279 | ||
| 548 | 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| basic | 344824 | |
| 201279 | ||
| 548 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| b | 546103 | |
| a | 546103 | |
| c | 546103 | |
| o | 403654 | |
| s | 344824 | |
| i | 344824 | |
| e | 201827 | 6.0% |
| f | 201279 | 6.0% |
| k | 201279 | 6.0% |
| g | 1096 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3337640 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| b | 546103 | |
| a | 546103 | |
| c | 546103 | |
| o | 403654 | |
| s | 344824 | |
| i | 344824 | |
| e | 201827 | 6.0% |
| f | 201279 | 6.0% |
| k | 201279 | 6.0% |
| g | 1096 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3337640 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| b | 546103 | |
| a | 546103 | |
| c | 546103 | |
| o | 403654 | |
| s | 344824 | |
| i | 344824 | |
| e | 201827 | 6.0% |
| f | 201279 | 6.0% |
| k | 201279 | 6.0% |
| g | 1096 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3337640 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| b | 546103 | |
| a | 546103 | |
| c | 546103 | |
| o | 403654 | |
| s | 344824 | |
| i | 344824 | |
| e | 201827 | 6.0% |
| f | 201279 | 6.0% |
| k | 201279 | 6.0% |
| g | 1096 | < 0.1% |
language
Categorical
| Distinct | 25 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| en | |
|---|---|
| fr | 3420 |
| es | 2296 |
| de | 2236 |
| zh | 1694 |
| Other values (20) | 4700 |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1093302 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | en |
|---|---|
| 2nd row | en |
| 3rd row | en |
| 4th row | en |
| 5th row | en |
Common Values
| Value | Count | Frequency (%) |
| en | 532305 | |
| fr | 3420 | 0.6% |
| es | 2296 | 0.4% |
| de | 2236 | 0.4% |
| zh | 1694 | 0.3% |
| it | 1042 | 0.2% |
| ko | 909 | 0.2% |
| ru | 705 | 0.1% |
| nl | 367 | 0.1% |
| pt | 359 | 0.1% |
| Other values (15) | 1318 | 0.2% |
Length
| Value | Count | Frequency (%) |
| en | 532305 | |
| fr | 3420 | 0.6% |
| es | 2296 | 0.4% |
| de | 2236 | 0.4% |
| zh | 1694 | 0.3% |
| it | 1042 | 0.2% |
| ko | 909 | 0.2% |
| ru | 705 | 0.1% |
| nl | 367 | 0.1% |
| pt | 359 | 0.1% |
| Other values (15) | 1318 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 536922 | |
| n | 532766 | |
| r | 4238 | 0.4% |
| f | 3447 | 0.3% |
| s | 2681 | 0.2% |
| d | 2358 | 0.2% |
| h | 1731 | 0.2% |
| z | 1694 | 0.2% |
| t | 1529 | 0.1% |
| i | 1094 | 0.1% |
| Other values (9) | 4842 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1093302 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 536922 | |
| n | 532766 | |
| r | 4238 | 0.4% |
| f | 3447 | 0.3% |
| s | 2681 | 0.2% |
| d | 2358 | 0.2% |
| h | 1731 | 0.2% |
| z | 1694 | 0.2% |
| t | 1529 | 0.1% |
| i | 1094 | 0.1% |
| Other values (9) | 4842 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1093302 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 536922 | |
| n | 532766 | |
| r | 4238 | 0.4% |
| f | 3447 | 0.3% |
| s | 2681 | 0.2% |
| d | 2358 | 0.2% |
| h | 1731 | 0.2% |
| z | 1694 | 0.2% |
| t | 1529 | 0.1% |
| i | 1094 | 0.1% |
| Other values (9) | 4842 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1093302 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 536922 | |
| n | 532766 | |
| r | 4238 | 0.4% |
| f | 3447 | 0.3% |
| s | 2681 | 0.2% |
| d | 2358 | 0.2% |
| h | 1731 | 0.2% |
| z | 1694 | 0.2% |
| t | 1529 | 0.1% |
| i | 1094 | 0.1% |
| Other values (9) | 4842 | 0.4% |
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| direct | |
|---|---|
| sem-brand | |
| sem-non-brand | |
| seo | 24102 |
| other | 15816 |
| Other values (3) | 21982 |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.801626632 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3718116 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | seo |
|---|---|
| 2nd row | direct |
| 3rd row | direct |
| 4th row | direct |
| 5th row | other |
Common Values
| Value | Count | Frequency (%) |
| direct | 365381 | |
| sem-brand | 71891 | 13.2% |
| sem-non-brand | 47479 | 8.7% |
| seo | 24102 | 4.4% |
| other | 15816 | 2.9% |
| api | 13700 | 2.5% |
| content | 5501 | 1.0% |
| remarketing | 2781 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| direct | 365381 | |
| sem-brand | 71891 | 13.2% |
| sem-non-brand | 47479 | 8.7% |
| seo | 24102 | 4.4% |
| other | 15816 | 2.9% |
| api | 13700 | 2.5% |
| content | 5501 | 1.0% |
| remarketing | 2781 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 535732 | |
| r | 506129 | |
| d | 484751 | |
| t | 394980 | |
| i | 381862 | |
| c | 370882 | |
| n | 228111 | |
| - | 166849 | 4.5% |
| s | 143472 | 3.9% |
| a | 135851 | 3.7% |
| Other values (7) | 369497 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3551267 | |
| Dash Punctuation | 166849 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 535732 | |
| r | 506129 | |
| d | 484751 | |
| t | 394980 | |
| i | 381862 | |
| c | 370882 | |
| n | 228111 | |
| s | 143472 | 4.0% |
| a | 135851 | 3.8% |
| m | 122151 | 3.4% |
| Other values (6) | 247346 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 166849 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3551267 | |
| Common | 166849 | 4.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 535732 | |
| r | 506129 | |
| d | 484751 | |
| t | 394980 | |
| i | 381862 | |
| c | 370882 | |
| n | 228111 | |
| s | 143472 | 4.0% |
| a | 135851 | 3.8% |
| m | 122151 | 3.4% |
| Other values (6) | 247346 |
Common
| Value | Count | Frequency (%) |
| - | 166849 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3718116 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 535732 | |
| r | 506129 | |
| d | 484751 | |
| t | 394980 | |
| i | 381862 | |
| c | 370882 | |
| n | 228111 | |
| - | 166849 | 4.5% |
| s | 143472 | 3.9% |
| a | 135851 | 3.7% |
| Other values (7) | 369497 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| untracked | |
|---|---|
| linked | |
| omg | |
| tracked-other | 12068 |
| product | 3735 |
| Other values (2) | 329 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.210560303 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3941660 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | untracked |
|---|---|
| 2nd row | untracked |
| 3rd row | untracked |
| 4th row | untracked |
| 5th row | untracked |
Common Values
| Value | Count | Frequency (%) |
| untracked | 300967 | |
| linked | 119437 | 21.8% |
| omg | 110115 | 20.1% |
| tracked-other | 12068 | 2.2% |
| product | 3735 | 0.7% |
| marketing | 236 | < 0.1% |
| local ops | 93 | < 0.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| untracked | 300967 | |
| linked | 119437 | 21.8% |
| omg | 110115 | 20.1% |
| tracked-other | 12068 | 2.2% |
| product | 3735 | 0.7% |
| marketing | 236 | < 0.1% |
| local | 93 | < 0.1% |
| ops | 93 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 444776 | |
| d | 436207 | |
| k | 432708 | |
| n | 420640 | |
| t | 329074 | |
| r | 329074 | |
| c | 316863 | |
| a | 313364 | |
| u | 304702 | |
| o | 126104 | 3.2% |
| Other values (9) | 488148 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3929499 | |
| Dash Punctuation | 12068 | 0.3% |
| Space Separator | 93 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 444776 | |
| d | 436207 | |
| k | 432708 | |
| n | 420640 | |
| t | 329074 | |
| r | 329074 | |
| c | 316863 | |
| a | 313364 | |
| u | 304702 | |
| o | 126104 | 3.2% |
| Other values (7) | 475987 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12068 |
Space Separator
| Value | Count | Frequency (%) |
| 93 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3929499 | |
| Common | 12161 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 444776 | |
| d | 436207 | |
| k | 432708 | |
| n | 420640 | |
| t | 329074 | |
| r | 329074 | |
| c | 316863 | |
| a | 313364 | |
| u | 304702 | |
| o | 126104 | 3.2% |
| Other values (7) | 475987 |
Common
| Value | Count | Frequency (%) |
| - | 12068 | |
| 93 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3941660 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 444776 | |
| d | 436207 | |
| k | 432708 | |
| n | 420640 | |
| t | 329074 | |
| r | 329074 | |
| c | 316863 | |
| a | 313364 | |
| u | 304702 | |
| o | 126104 | 3.2% |
| Other values (9) | 488148 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| Web | |
|---|---|
| iOS | 28680 |
| Moweb | 7944 |
| Android | 5842 |
Length
| Max length | 7 |
|---|---|
| Median length | 3 |
| Mean length | 3.071811814 |
| Min length | 3 |
Characters and Unicode
| Total characters | 1679209 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Web |
|---|---|
| 2nd row | Web |
| 3rd row | Web |
| 4th row | Web |
| 5th row | Web |
Common Values
| Value | Count | Frequency (%) |
| Web | 504185 | |
| iOS | 28680 | 5.2% |
| Moweb | 7944 | 1.5% |
| Android | 5842 | 1.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| web | 504185 | |
| ios | 28680 | 5.2% |
| moweb | 7944 | 1.5% |
| android | 5842 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 512129 | |
| b | 512129 | |
| W | 504185 | |
| i | 34522 | 2.1% |
| O | 28680 | 1.7% |
| S | 28680 | 1.7% |
| o | 13786 | 0.8% |
| d | 11684 | 0.7% |
| M | 7944 | 0.5% |
| w | 7944 | 0.5% |
| Other values (3) | 17526 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1103878 | |
| Uppercase Letter | 575331 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 512129 | |
| b | 512129 | |
| i | 34522 | 3.1% |
| o | 13786 | 1.2% |
| d | 11684 | 1.1% |
| w | 7944 | 0.7% |
| n | 5842 | 0.5% |
| r | 5842 | 0.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 504185 | |
| O | 28680 | 5.0% |
| S | 28680 | 5.0% |
| M | 7944 | 1.4% |
| A | 5842 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1679209 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 512129 | |
| b | 512129 | |
| W | 504185 | |
| i | 34522 | 2.1% |
| O | 28680 | 1.7% |
| S | 28680 | 1.7% |
| o | 13786 | 0.8% |
| d | 11684 | 0.7% |
| M | 7944 | 0.5% |
| w | 7944 | 0.5% |
| Other values (3) | 17526 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1679209 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 512129 | |
| b | 512129 | |
| W | 504185 | |
| i | 34522 | 2.1% |
| O | 28680 | 1.7% |
| S | 28680 | 1.7% |
| o | 13786 | 0.8% |
| d | 11684 | 0.7% |
| M | 7944 | 0.5% |
| w | 7944 | 0.5% |
| Other values (3) | 17526 | 1.0% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 4.2 MiB |
| NDF | |
|---|---|
| GB | |
| ES | |
| US | |
| NL | |
| Other values (7) |
Length
| Max length | 5 |
|---|---|
| Median length | 2 |
| Mean length | 2.346374561 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1282648 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NDF |
|---|---|
| 2nd row | US |
| 3rd row | other |
| 4th row | US |
| 5th row | US |
Common Values
| Value | Count | Frequency (%) |
| NDF | 54850 | |
| GB | 52705 | |
| ES | 50518 | |
| US | 47704 | |
| NL | 47597 | |
| PT | 47100 | |
| other | 44832 | |
| FR | 43937 | |
| CA | 42544 | |
| IT | 40225 | |
| Other values (2) | 74639 |
Length
| Value | Count | Frequency (%) |
| ndf | 54850 | |
| gb | 52705 | |
| es | 50518 | |
| us | 47704 | |
| nl | 47597 | |
| pt | 47100 | |
| other | 44832 | |
| fr | 43937 | |
| ca | 42544 | |
| it | 40225 | |
| Other values (2) | 74639 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 102447 | 8.0% |
| F | 98787 | 7.7% |
| S | 98222 | 7.7% |
| D | 92685 | 7.2% |
| E | 88353 | 6.9% |
| T | 87325 | 6.8% |
| U | 84508 | 6.6% |
| A | 79348 | 6.2% |
| B | 52705 | 4.1% |
| G | 52705 | 4.1% |
| Other values (10) | 445563 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1058488 | |
| Lowercase Letter | 224160 | 17.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 102447 | |
| F | 98787 | |
| S | 98222 | |
| D | 92685 | |
| E | 88353 | 8.3% |
| T | 87325 | 8.2% |
| U | 84508 | 8.0% |
| A | 79348 | 7.5% |
| B | 52705 | 5.0% |
| G | 52705 | 5.0% |
| Other values (5) | 221403 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 44832 | |
| t | 44832 | |
| h | 44832 | |
| e | 44832 | |
| r | 44832 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1282648 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 102447 | 8.0% |
| F | 98787 | 7.7% |
| S | 98222 | 7.7% |
| D | 92685 | 7.2% |
| E | 88353 | 6.9% |
| T | 87325 | 6.8% |
| U | 84508 | 6.6% |
| A | 79348 | 6.2% |
| B | 52705 | 4.1% |
| G | 52705 | 4.1% |
| Other values (10) | 445563 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1282648 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 102447 | 8.0% |
| F | 98787 | 7.7% |
| S | 98222 | 7.7% |
| D | 92685 | 7.2% |
| E | 88353 | 6.9% |
| T | 87325 | 6.8% |
| U | 84508 | 6.6% |
| A | 79348 | 6.2% |
| B | 52705 | 4.1% |
| G | 52705 | 4.1% |
| Other values (10) | 445563 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| signup_flow | days_from_first_active_until_account_created | days_from_account_created_until_first_booking | day_first_booking | day_of_week_first_booking | year_account_created | day_account_created | day_of_week_account_created | week_of_year_account_created | gender | age | signup_method | language | affiliate_channel | first_affiliate_tracked | signup_app | country_destination | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 732 | 1496 | 29 | 0 | 2011 | 25 | 2 | 21 | MALE | 38 | en | seo | untracked | Web | NDF | |
| 1 | 3 | 476 | -57 | 2 | 0 | 2010 | 28 | 1 | 39 | FEMALE | 56 | basic | en | direct | untracked | Web | US |
| 2 | 0 | 765 | 278 | 8 | 5 | 2011 | 5 | 0 | 49 | FEMALE | 42 | en | direct | untracked | Web | other | |
| 3 | 0 | 280 | -208 | 18 | 3 | 2010 | 14 | 1 | 37 | -unknown- | 41 | basic | en | direct | untracked | Web | US |
| 4 | 0 | 0 | 3 | 5 | 1 | 2010 | 2 | 5 | 53 | FEMALE | 46 | basic | en | other | untracked | Web | US |
| 5 | 0 | 0 | 10 | 13 | 2 | 2010 | 3 | 6 | 53 | FEMALE | 47 | basic | en | direct | omg | Web | US |
| 6 | 0 | 0 | 206 | 29 | 3 | 2010 | 4 | 0 | 1 | FEMALE | 50 | basic | en | other | untracked | Web | US |
| 7 | 0 | 0 | 0 | 4 | 0 | 2010 | 4 | 0 | 1 | -unknown- | 46 | basic | en | other | omg | Web | US |
| 8 | 0 | 0 | 2 | 6 | 2 | 2010 | 4 | 0 | 1 | FEMALE | 36 | basic | en | other | untracked | Web | US |
| 9 | 0 | 0 | 2001 | 29 | 0 | 2010 | 5 | 1 | 1 | FEMALE | 47 | basic | en | other | untracked | Web | NDF |
Last rows
| signup_flow | days_from_first_active_until_account_created | days_from_account_created_until_first_booking | day_first_booking | day_of_week_first_booking | year_account_created | day_account_created | day_of_week_account_created | week_of_year_account_created | gender | age | signup_method | language | affiliate_channel | first_affiliate_tracked | signup_app | country_destination | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 546641 | 0 | 0 | 1 | 19 | 3 | 2012 | 18 | 2 | 51 | FEMALE | 25 | basic | en | direct | untracked | Web | other |
| 546642 | 25 | 0 | 74 | 19 | 4 | 2014 | 5 | 2 | 12 | FEMALE | 30 | basic | en | direct | untracked | iOS | other |
| 546643 | 0 | 0 | 1 | 20 | 2 | 2013 | 19 | 1 | 47 | MALE | 25 | basic | ko | sem-non-brand | omg | Web | other |
| 546644 | 0 | 0 | 1 | 2 | 2 | 2013 | 1 | 1 | 40 | FEMALE | 41 | basic | en | direct | omg | Web | other |
| 546645 | 0 | 0 | 1 | 22 | 4 | 2012 | 21 | 3 | 12 | FEMALE | 33 | basic | en | direct | linked | Web | other |
| 546646 | 0 | 0 | 0 | 21 | 0 | 2014 | 21 | 0 | 17 | -unknown- | 41 | basic | en | sem-brand | omg | Web | other |
| 546647 | 0 | 0 | 55 | 26 | 5 | 2012 | 2 | 0 | 27 | MALE | 28 | en | sem-brand | omg | Web | other | |
| 546648 | 0 | 0 | 22 | 12 | 3 | 2014 | 21 | 2 | 21 | -unknown- | 30 | basic | en | direct | untracked | Web | other |
| 546649 | 0 | 0 | 2 | 15 | 2 | 2014 | 13 | 0 | 3 | FEMALE | 30 | basic | en | direct | linked | Web | other |
| 546650 | 1 | 0 | 1 | 23 | 0 | 2012 | 22 | 6 | 29 | FEMALE | 24 | basic | en | direct | linked | Web | other |
Most frequently occurring
| signup_flow | days_from_first_active_until_account_created | days_from_account_created_until_first_booking | day_first_booking | day_of_week_first_booking | year_account_created | day_account_created | day_of_week_account_created | week_of_year_account_created | gender | age | signup_method | language | affiliate_channel | first_affiliate_tracked | signup_app | country_destination | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 6749 | 0 | 0 | 0 | 15 | 5 | 2013 | 15 | 5 | 24 | FEMALE | 31 | en | direct | untracked | Web | PT | 132 | |
| 113 | 0 | 0 | 0 | 1 | 2 | 2012 | 1 | 2 | 9 | FEMALE | 40 | basic | en | sem-brand | linked | Web | PT | 127 |
| 2531 | 0 | 0 | 0 | 6 | 3 | 2013 | 6 | 3 | 23 | FEMALE | 25 | basic | en | direct | untracked | Web | PT | 112 |
| 11243 | 0 | 0 | 0 | 25 | 5 | 2013 | 25 | 4 | 4 | FEMALE | 25 | en | direct | linked | Web | PT | 107 | |
| 9966 | 0 | 0 | 0 | 23 | 0 | 2014 | 23 | 0 | 26 | FEMALE | 21 | en | seo | untracked | Web | PT | 97 | |
| 12049 | 0 | 0 | 0 | 27 | 3 | 2013 | 27 | 3 | 26 | FEMALE | 45 | basic | en | sem-non-brand | omg | Web | PT | 96 |
| 913 | 0 | 0 | 0 | 3 | 0 | 2013 | 3 | 0 | 10 | -unknown- | 35 | basic | en | sem-brand | omg | Web | PT | 95 |
| 1041 | 0 | 0 | 0 | 3 | 1 | 2013 | 3 | 1 | 23 | FEMALE | 19 | basic | en | direct | untracked | Web | PT | 90 |
| 1954 | 0 | 0 | 0 | 5 | 1 | 2013 | 5 | 1 | 10 | MALE | 25 | en | sem-brand | omg | Web | PT | 86 | |
| 5808 | 0 | 0 | 0 | 13 | 5 | 2013 | 13 | 5 | 27 | -unknown- | 36 | basic | en | direct | untracked | Web | PT | 86 |